A Generic Data Harmonization Process for Cross-linked Research and Network Interaction. Construction and Application for the Lung Cancer Phenotype Database of the German Center for Lung Research.
نویسندگان
چکیده
OBJECTIVE Joint data analysis is a key requirement in medical research networks. Data are available in heterogeneous formats at each network partner and their harmonization is often rather complex. The objective of our paper is to provide a generic approach for the harmonization process in research networks. We applied the process when harmonizing data from three sites for the Lung Cancer Phenotype Database within the German Center for Lung Research. METHODS We developed a spreadsheet-based solution as tool to support the harmonization process for lung cancer data and a data integration procedure based on Talend Open Studio. RESULTS The harmonization process consists of eight steps describing a systematic approach for defining and reviewing source data elements and standardizing common data elements. The steps for defining common data elements and harmonizing them with local data definitions are repeated until consensus is reached. Application of this process for building the phenotype database led to a common basic data set on lung cancer with 285 structured parameters. The Lung Cancer Phenotype Database was realized as an i2b2 research data warehouse. CONCLUSION Data harmonization is a challenging task requiring informatics skills as well as domain knowledge. Our approach facilitates data harmonization by providing guidance through a uniform process that can be applied in a wide range of projects.
منابع مشابه
A Generic Data Harmonization Process for Cross-linked Research and Network Interaction
Objective: Joint data analysis is a key requirement in medical research networks. Data are available in heterogeneous formats at each network partner and their harmoni zation is often rather complex. The objective of our paper is to provide a generic approach for the harmonization process in research networks. We applied the process when harmonizing data from three sites for the Lung Cancer Phe...
متن کاملBioinformatics Identification of miRNA-mRNA Regulatory Network Contributing Primary Lung Cancer
Introduction: In clinical practice, distinguishing invasive lung tumors from primary tumors remains a challenge. With recent advances in understanding biological alterations of tumorigenesis and molecular analytic technologies, using these molecular alterations can be sensitive and tumor-specific as biomarker for the stratification of patients. In this study, the molecular network of miRNA-mRNA...
متن کاملBioinformatics identification of miRNA-mRNA regulatory network contributing to lung cancer invasion
Background: Over the past 15 years, significant insights have been gained into the roles of miRNAs in cancer. In various cancers, miRNAs can act as oncogenes, tumor suppressors, or control the metastasis process by modulating the expression of numerous target genes. This study is aimed at determining molecular network of miRNA-mRNA regulating lung cancer invasion, by bioinformatics approaches. ...
متن کاملGene Regulation Network Based Analysis Associated with TGF-beta Stimulation in Lung Adenocarcinoma Cells
Background: Transforming growth factor (TGF)-β is over-expressed in a wide variety of cancers such as lung adenocarcinoma. TGF-β plays a major role in cancer progression through regulating cancer cell proliferation and remodeling of the tumor micro-environment. However, it is still a great challenge to explain the phenotypic effects caused by TGF-β stimulation and the effect of TGF-β stimulatio...
متن کاملIdentification of a Novel Tumor-Binding Peptide for Lung Cancer Through in-vitro Panning
Tumor-targeted therapies are playing growing roles in cancer research. The exploitation of these powerful therapeutic modalities largely depends on the discovery of tumor-targeting ligands. Phage display has proven a promising high throughput screening tool for the identification of novel specific peptides with high binding affinity to cancer cells. In the present study, we describe the use of ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Methods of information in medicine
دوره 54 5 شماره
صفحات -
تاریخ انتشار 2015